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Adaptive  Information  Fusion  in  Asymmetric  Sensemaking  Environment 


Abstract 

The  existing  sensemaking  models  for  traditional  force-on-force  battlefield  infonnation 
management  rarely  survive  the  kinds  of  information  in  asymmetric  battlespace 
environments.  By  combining  the  abduction  process  and  Bayesian  probability  network 
formalisms,  we  propose  a  Bayesian  Abduction  Models  (BAM)  to  support  in  the 
sensemaking  process  of  evaluating  multiple  hypotheses  in  the  context  of  changing 
information.  This  paper  describes  a  Bayesian  network  that  captures  abduction  logic 
primitives  from  a  kernel  of  disparate  information  sources.  We  use  a  genetic  learning 
algorithm  to  solve  BAM  information  fusion  problems.  We  show  how  the  model  can  be 
used  in  prospective  and  retrospective  sensemaking  conditions  to  simulate  the  ways 
commanders  and  the  battle  staffs  process  infonnation. 

Introduction 

Consider  the  current  military  conflicts  in  Iraq  and  Afghanistan.  The  adversary 
environment  is  known  to  be  complex,  “  wicked”  and  completely  asymmetric— the 
adversaries  are  barely  known,  and  their  tactics  keep  changing  against  the  coalition  forces. 
The  deliberate  military  decision  making  processes  (MDMP)  with  all  their  linearity 
assumptions  collapse  immediately  in  contact  with  asymmetric  information  environments. 
Generating  courses  of  action  must  be  progressive  and  opportunistic— the  usual  analytical 
models  of  judgment  and  choice  that  fit  force-on-force  tactics  must  be  recalibrated  to  fight 
against  unknown  enemies.  Sensemaking,  the  process  of  connecting  dots  to  disparate 
information  and  seeking  explanation  to  potentially  unexpected  evolving  situations,  has 
been  suggested  as  an  embellishment  or  precursor  to  existing  MDMP.  Unfortunately, 
these  nascent  decision  systems  lack  analytical  models  that  can  capture  the  evolving  states 
of  battle  dynamics  and  its  information  equivocality.  The  proposed  method  seeks  to 
minimize  this  problem  by  developing  a  probabilistic  abduction  model  for  sensemaking 
process. 

To  help  elucidate  our  point  of  discourse,  consider  a  fictitious  case  in  the  current  conflict 
in  Iraq.  We  can  use  a  hypothetical  network  depicted  below  to  illustrate  an  example  of 
analyzing  the  Iraq  insurgency.  The  top  most  variable  H0  will  represent  a  composite 
hypothesis  for  a  desired  end  state  problem.  For  example,  we  can  hypothesize  that, 
according  to  intelligent  speculations,  that  Iran  is  responsible  for  the  sectarian  violence. 
The  variables  A,  form  a  subset  of  H0  and  will  represent  the  operational  focus  (e.g., 
funneling  money  and  weapons  to  insurgents,  covert  operations  in  Iraq,  etc.);  A)  may 
represent  the  perceived  motives  f;  S,  may  represent  the  influence  path  (example:  Al-Sadr 
militia  cell,  Al-Qaeda  cell,  etc.)  responsible  for  attacking  targets  m,  (e.g.,  mosques, 
coalition  forces,  kidnapping,  etc.).  Figure  1  shows  the  network  of  the  information 
described  above. 
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Figure  1 :  Example  network  where  {  hi,Xi,Si,mi}  represent  the  End  state,  Operational 
Focus,  Motivation,  Influence  Pathways,  and  the  Target  variables,  respectively. 

From  a  sensemaking  perspective  we  are  interested  in  knowing  what  happens  when  new 
information  unexpectedly  arrives  to  the  intelligent  analyst.  For  instance: 

1)  The  adversaries  change  their  attack  methods;  and 

2)  New  targets  are  exploited  by  the  adversaries. 

From  the  list  of  possible  hypotheses  and  variables,  the  analyst  is  interested  in  determining 
the  most  probable  explanation,  and/or  making  the  best  inference  from  the  given  evidence. 
The  existing  courses  of  action  and  planning  models  rarely  survive  the  kinds  of 
information  described  above.  Sensemaking  is  suggested  as  a  model  for  situations  with 
ambiguities  such  as  the  one  in  the  above  case;  more  so,  abductive  reasoning  is  suggested 
as  its  supporting  tool.  Abduction  is  a  reasoning  process  that  tries  to  form  plausible 
explanations  for  abnormal  observations.  A  typical  abduction  task  is  a  classification  of  a 
given  data  set  into  potentially  relevant  elementary  explanatory  hypotheses.  By  combining 
the  abduction  task  and  Bayesian  probability  formalisms,  we  have  developed  a  Bayesian 
Abduction  Model  (BAM)  to  support  in  the  performance  analysis  during  a  sensemaking 
process  such  as  illustrated  in  the  sample  case  above. 


Theoretical  Foundation 


Developing  an  abduction  driven  Bayesian  model  of  sensemaking  begs  for  an  important 
question:  “Can  sensemaking  with  all  its  tacit  dimensions  of  knowledge  be  represented 
mathematically  (and  computationally)?  Our  answer  is  definitely  yes,  but  with  a  caution 
about  avoiding  over  generalization. 

Let  us  review  some  of  the  existing  models  developed  to  either  target  sensemaking  or  its 
pseudo-variances.  Computationally,  Schmidt  (1994)  view  sensemaking  as  a  symbolic 
system  of  human  communication  when  he  notes  that  “in  systems  that  hold  and 
manipulate  information,  it  is  possible  for  a  system  to  hold  and  manipulate  infonnation 
that  represents  the  system  itself,  in  such  a  way  that  there  is  a  causal  link  in  both  directions 
between  the  system  and  the  information;  if  the  system  changes  the  information,  the 
system  itself  changes  accordingly.  These  (conditions)  are  self  reference  that  make  goal 
directed  (sensemaking)  systems  symbolic  and  computational  reflective  systems.”  Schank 
(1982)  observes  that  sensemaking  is  a  system  of  actions,  symbols  and  processes  that 
enables  an  organization  to  transform  information  into  valued  knowledge  which  in  turn 
increases  its  long  run  adaptive  capacity  (1982;  pp.8).  Weick  (1995)  notes  that 
sensemaking  is  a  theory  and  a  process  of  how  people  reduce  uncertainty  or 
ambiguity. .  .during  decision  making.  In  DARPA’s  Information  Awareness  Project 
initiatives,  sensemaking  is  considered  an  important  tool  for  the  Future  Combat  Force 
because,  with  fragmentary  battle  space  infonnation,  “meaning  has  to  be  derived  from 
these  fragmentary  cues”. 

Peircean  philosophy  provides  a  foundation  for  understanding  human  reasoning  and 
capturing  behavioral  characteristics  of  decision  makers  due  to  cultural,  physiological,  and 
psychological  effects.  Peirce’s  theory  focuses  on  a  system  of  logic  that  can  achieve  the 
best  possible  conclusions  based  on  the  available  information.  Pierce  (1877)  first 
described  abductive  inference  by  providing  two  intuitive  characterizations:  given  an 
observation  d  and  the  knowledge  that  h  causes  d,  it  is  an  abduction  to  hypothesize  that  h 
occurred;  and  given  a  proposition  q  and  the  knowledge  that  /?—»<?,  it  is  an  abduction  to 
conclude  p.  In  either  case,  abduction  is  uncertain  because  something  else  might  be  the 
actual  cause  of  d,  or  because  the  reasoning  pattern  is  the  classical  fallacy  of  “affirming 
the  consequent”  and  is  formally  invalid.  Additional  difficulties  can  exist  because  h  might 
not  always  cause  d,  or  because  p  might  imply  q  only  by  default.  Generally,  we  can  say 
that  h  explains  d  and  p  explains  q  and  we  shall  refer  to  h  and  p  as  hypotheses  and  d  and  q 
as  data.  Peirce  (1877)  further  defined  the  process  of  inquiry  or  discovery  as  including 
three  fundamental  inference  processes: 

1)  Abduction  generation  of  hypotheses  to  explain  new  anomalous  data. 

2)  Deduction  performs  the  function  of  making  a  prediction  as  to  what  would  occur  if 

the  hypotheses  were  to  turn  out  to  be  the  case. 

3)  Induction  finds  the  ratio  of  the  frequency  by  which  the  necessary  results  of 

deduction  does  in  fact  occur. 


Abduction  is  then,  a  reasoning  process  that  tries  to  form  plausible  explanations  for 
abnonnal  observations.  It  is  distinct  from  deduction  and  induction  in  that  it  is  inherently 
uncertain  since  information  or  data  supporting  abduction  process  is  dynamic  in  nature, 
leading  to  human  construction  of  multiple  and  often  competing  hypotheses. 

Bayes  Theory 

We  have  alluded  to  the  use  of  Bayesian  theory  in  our  proposed  work.  What  follows  is  a 
short  summary  on  the  foundation  of  the  Bayesian  approach  (Pearl,  1995).  In  any  situation 
in  which  we  have  to  make  decisions  we  are  often  interested  in  determining  the  best 
hypothesis  from  some  construct  space  H,  given  observed  data  D.  Bayes  theorem 
provides  a  way  to  calculate  the  probability  of  a  hypothesis  based  on  its  prior  probability, 
the  probabilities  of  observing  various  data  given  the  hypothesis  and  the  observed  data 
itself.  To  define  Bayes  theorem  precisely,  we  first  need  to  define  the  notations  used.  Let 
P(h)  denote  the  initial  probability  that  hypothesis  h  holds,  before  we  incorporate  any  new 
data.  P(h)  is  the  prior  probability  of  h  and  may  reflect  any  background  knowledge  we 
have  about  the  chance  that  h  is  our  atypical  belief  or  a  correct  hypothesis.  If  no  such  prior 
knowledge  exists,  let  P(D)  denote  the  probability  that  evidence  data  D  will  be  observed. 
P(D)  represents  the  probability  of  evidence  D  given  no  knowledge  about  which 
hypothesis  holds.  Let  P(D\h)  denote  the  probability  of  observing  data  D  given  some 
world  in  which  hypothesis  h  holds.  We  are  interested  in  the  probability  P(h\D)  that  h 
holds  given  the  observed  data  D.  P(D)  serves  to  confirm,  reject,  or  modify  our  initial 
belief  about  h.  P(h\D)  is  called  the  posterior  probability  of  h  because  it  reflects  our 
confidence  that  h  holds  after  we  have  seen  some  evidence  D. 

Bayes  theorem  provides  a  way  to  calculate  the  posterior  probability  P(h\D),  from  prior 
probability  P(h), together  with  P(D)  and  P(D\h)  and  can  be  mathematically  stated  as, 

P(„  lD)=wm . (i) 

P(D) 

In  previous  studies,  Pate-Cornell  (2001)  used  Bayesian  analysis  to  study  intelligence 
fusion.  McLaughlin  and  Pate-Cornell  (2005)  used  Bayesian  techniques  to  provide  an 
analytical  illustration  of  Iraq’s  nuclear  program  intelligence.  Sticha,  Buede  and  Rees 
(2005)  developed  APOLLO,  an  analytical  tool  for  predicting  a  subject’s  decision  making. 
Starr  and  Shi  (2004)  conducted  a  study  on  Bayesian  belief  networks  and  their 
applications  to  land  operations  for  the  Australian  military.  So  far,  there  has  been  no 
substantive  study  of  the  application  of  Bayesian  networks  in  sensemaking.  There  are 
several  reasons  for  this.  First,  equation  (i)  above  cannot  handle  well  hypotheses  of 
multiple  disorders  since  Bayesian  models  are  more  grounded  in  diagnostics  decision 
making  process  (Pearl,  1988).  For  example,  given  two  independent  hypotheses  hj  and  h2 
and  a  common  data  set  DltD2,  the  computation  P(Dj\hjAh2)  presents  a  serious 
logical  analysis  challenge.  Secondly,  it  is  difficult  to  handle  causal  chaining  where  there 
is  no  direct  influence;  note  that  the  success  of  Bayesian  Belief  Networks  (BBN),  e.g. 

Pearl  (2000),  is  based  on  the  availability  of  direct  conditional  influences. 


Abduction  and  Bayesian  Model 


The  existing  models  of  abduction  are  purely  from  the  logical  approach  (Konolige, 
1992). Our  model  is  not  for  logical  reasoning.  We  are  interested  in  the  probabilistic 
models  of  uncertainties  that  allow  some  causal  inference  to  take  place  in  a  sensemaking 
information  network.  In  this  case,  the  relationship  between  Bayesian  reasoning  and 
abduction  is  governed  by  the  assertion  related  only  to  a  set  of  plausible  explanations 
(Prakken,  2004).  Simply 

Let  P(w)  =  I  P(E)  (ii) 

Where  E  is  an  explanation  of  world  w 

P(E)  =  PL  5  P(h )  (Assuming  independent  events  E) — (iii) 


P(w\E)  = 


P(w&E) 
P(E ) 


<—  explains  w  &E 
<—  explains  E 


(iv) 


P(w|E)  may  represent,  say,  mass  demonstration  by  Iraqi  citizens  because  of  bombing  of  a 
mosque  by  the  coalition  force.  The  abduction  problem  in  sensemaking  is:  given  E, 
explain  E,  then  try  to  explain  w  from  these  explanations. 


Mathematical  Illustration 


We  briefly  demonstrate  the  Bayesian  abductive  inference  using  a  mathematical 
illustration.  For  simplicity,  inference  is  performed  only  for  a  part  of  the  network  as  shown 
in  Figure  2  below.  We  define  an  end  state  of  the  network  as  a  composite  hypothesis  Ha 
and  to  this  we  assign  a  prior  probability.  The  prior  probability  can  be  assumed  based  on 
the  level  of  past  infonnation  that  we  have  about  a  particular  situation  that  is  of  interest. 
For  example,  H0  could  be  maintaining  stability  operations  in  Bagdad.  The  estimated 
probability  could  be  from  the  news  media,  intelligence  briefings,  or  simply  the 
commander’s  estimate.  We  can  write,  P{H o)  =  0.4 


This  means  that  we  are  only  40%  confident  that  our  chosen  hypothesis  is  plausible.  By 
the  axioms  of  probability,  the  probability  of  an  alternative  hypothesis  P{H a ) 

representing  any  other  end  state  is  therefore,  P{H a )  =  0.6  and  we  need  not  explicitly 

state  this.  Similarly  we  can  assign  apriori  probabilities  for  the  conditional  probabilities  of 
interest  representing  the  probabilities  of  the  children  events,  given  the  parents. 

Next,  we  can  compute  the  prior  probabilities  of  all  the  instantiated  variables  as  follows 


P(hi)=  P(h1\H0)P(H0)  +P(h,\Ha)P(Ha)=  (0.9)(0.4)+(0.8)(0.6)=0.84 
P(Xl)=  P(xi\h/)P(hi)  +  P(xi\h2)P(h2)=(0.7)(0.84)+(0.4)(0.16)  =0.652 


P(S,)=P(S,\xl)P(x,)  +P(S, \x2)P(x2) = (0. 5)(0. 652)  +(0.6)(0.348)  = 0.5348 
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Figure  2:  Example  network  where  {h,  ,x.  Si}  represent  the  End  state,  Operational  Focus, 
Motivation  and  the  Influence  Pathways  variables  respectively 

Now  suppose  the  variable  X  is  instantiated  for  xj.  Since  the  Markov  condition  entails  that 
each  variable  is  conditionally  independent  of  the  next  variable  given  its  parents,  we  can 
compute 

P(hi\Ho)=0.9 

P(Xl\H0)=  P(x1\hI,H0)P(h1\H0)  +  (P(Xl\h2,H0)P(h2\H0) 

=  P(X!  fOPfi^ho)  +P(xi  I  h2)P(h2\H0) 

=  (0.7)(0.9)  +  (0.4)(0.1)  =0.67 

P(x2\H0)=  P(x2\h2,H0)P(h2\H0)  +P(x2\h1,H0)P(h1\H0) 

=P(x2\h2)P(h2\H0)  +  P(x2\h1)P(h1\H0) 

=(0.6)  (0.1)  +(0.4)(0.9)=0.42 


P(S!\H0)  =  P(Si\xi,h  ,)P(x,\  H0)+P(S, \x2,h2)P(x2 1 H0) 
=  P(S, \x,)P(x,  |  H0)  +P(S,  |  x2)P(x2  \Ho) 

=  (0.8)(0.67)  +  (0.6)(0.42)=  0.734 


Applying  abductive  inference,  we  can  compute 


P(xi\S])  = 


P(S1jx1)P(x}) 

P(St) 


(0.5X0.652)  _ 
0.5348 


To  compute  P(h]\S{),  we  again  apply  Bayes  theorem 


P(h,\S,) 


P(S,) 


But  we  need  to  first  compute  the  P(Si\h{).  That  is 

P(S,\h,)  =  P(S1\x,)P(x,\h,)P(S1\x2)  +  P(S1\x2)P(x2\hI)P(x2\h2) 

=  (0.5)  (0.7)  (0.6)  +  (0.6)(0.3)(0.6)  =0.318 

P(hi\Si)  =  0.504 

We  then  compute  the  probability  P  (S/\H0)  and  P(H0\Si)  in  a  sequence  as  follows 

P(S,\H0)  =  P(S,\h ,)  P(h  / 1  H0)+P(S,  |  h2)P(h2\H0) 

=  (0.5 3)  (0.9)+ (0.47)  (0.1)  =  0.524 

The  value  of  0.524  gives  the  numerical  probability  that  we  may  assign  to  our  degree  of 
belief  that  event  Si  will  happen  given  a  world  in  which  the  hypothesis  H0  holds  plus  all 
the  other  instantiated  variables.  Referring  to  our  fictitious  scenario  network,  we  can  say 
with  a  52%  certainty  that  the  end  state  represented  by  hypothesis  Ha  will  influence  event 
Si.  In  terms  of  prospective  sensemaking  Si  is  therefore  the  most  probable  explanation  for 
hypothesis  H0. 


Again,  by  using  Bayes  theorem 

w_  ns,\HjnHj  _  (0.524X0.4)  _ 

01  P(5, )  0.53 

Similarly,  given  the  influence  path  S\  ,we  can  perform  a  backward  inference  and  say  that 
S/  will  influence  the  desired  end  state  1 1()  only  39%  of  the  time  (i.e.,  probably  not  a  very 
significant  influence  path  for  this  hypothesis).  This  backward  inference  corresponds  to 
the  consequent— » antecedent  reasoning  or  the  retrospective  sensemaking  of  the  network 
scenario. 


Considering  the  network  shown  in  Figure  (1)  above 


P(mi)  =  YjP(m i  |  S1,S2,S3,..Sr) 

S,  ,..sr 

Because  of  the  independence  of  {Si,  S2,  S3...S1},  we  can  write 
P(m,)=  £P(m,l  S,.Sr)P(S,)P(S2)P(S,)...P(SJ 

S,..Sr 


P(m  1)  =  P(mx  |  SX)P(SX)  +  P(mx 


S2)P{S2)  +  P(mx 


S3)P(S3) . P(Wl|5r)P(5r) 


Clearly,  the  complexity  of  the  computation,  even  for  a  relatively  simple  network  can  be 
seen.  When  new  evidence  is  introduced,  the  analyst  is  interested  in  determining  the 
possible  effects  on  his  most  probable  hypothesis,  Ha.  Suppose  the  new  evidence  points  to 
a  new  target  to  be  exploited  by  the  insurgents.  The  new  target  may  be  a  coalition 
command  and  control  (C2)post  in  a  previously  secure  part  of  the  country.  This  would 
definitely  require  a  level  of  sophistication,  challenging  the  analyst’s  previous  hypothesis 
about  the  end  state  of  the  insurgency.  Using  a  Bayesian  abduction  inference,  we  can 
compute  the  state  of  the  network  with  variable  Xt  instantiated  as  follows: 


P{H0\X,) 
P(Xj  |  Ho ) 

P(Xt\H0) 


P(Xj  |  H0)P(H0) 

P(Xt) 

Yj(Xi\hn,Ho)P(hn\Ho) 

hx..hn 

P(X,  \hl,Ho)P(hl  \Ho)  +  P(Xx  \h2,Ho)P(h2  \H0) . P{Xx\hn,H0)P{hn  \H0) 

P(Xx\hx)P(h,  \H0)  +  P(Xx  \h2)P(h2  \Ha) . P(Xx\hn)P{hn  |  Ha) 


Once  the  state  (solution)  of  the  network  is  determined,  it  is  straightforward  to  perfonn 
forward  or  backward  inference.  It  is  easy  to  see  also  that  the  more  complex  the  network, 
the  more  difficult  the  computation.  Unfortunately  abductive  inference  in  belief  networks 
belongs  to  the  class  of  NP-hard  problems  (Cooper,  1990).  Complexity  increases 
drastically  as  a  function  of  the  number  of  undirected  cycles,  discrete  states  per  variable 
and  variables  in  the  network.  Approximate  solution  techniques  which  reduce  calculation 
time  and  generate  rankings  of  possible  hypotheses  have  been  introduced  as  an  alternative 

In  order  to  overcome  the  problem  of  computational  complexity,  the  BAM  uses  a  genetic 
algorithm  (GA)  to  perform  the  search  and  computation  for  the  most  probable  hypothesis. 
GA’s  can  handle  very  complex  network  problems  and  perform  efficient  and  fast 
computation  over  large  search  spaces.  Using  GA,  inference  is  performed  as  a  search  in  a 
large  discrete  multi-dimensional  space  of  competition  hypotheses.  Generally,  GA  can 
conduct  a  search  adaptively  and  thus  facilitates  the  discovery  of  a  hypothesis  path  with  a 
high  probability  instantiations. 


One  major  advantage  of  GA  is  that  we  can  represent  multiple  states  for  each  variable 
depending  on  the  cardinality  that  we  choose  for  the  genetic  coding.  Our  GA  model  uses 
probabilistic  transition  rules  to  propagate  search  along  the  direction  of  “best”  fit  in  the 
Bayesian  network,  making  use  of  Bayesian  characteristics  that  conditionally  explore  or 
prune  nodes  based  on  their  probabilistic  scores. 

The  first  step  in  applying  GA  to  our  BAM  model  is  to  code  all  the  variables  in  our 
hypothetical  network  as  a  finite  length  string.  The  simplest  scheme  is  to  use  two-variable 
cardinality  so  that  the  set  {0,1}  is  sufficient  to  represent  all  the  states  of  the  variables.  At 
any  instance,  the  state  of  the  network  can  be  fully  determined  by  using  a  vector  a,  where 

1  if  a  node  Cig  is  instantiated 

a  =  < 

I  0  otherwise 

At  each  level  k  of  the  network,  we  have  At  nodes  such  that  Ckj  e  Nk,j  -  1,2,3..,  n. 

The  resulting  network  representation  for  all  nodes  is  a  binary  pair  {Cj,  a}  for  all  nodes  k. 
The  initial  population  is  generated  by  coding  each  of  the  variables  with  a  {0,1} 
depending  on  the  state  of  the  instantiation.  The  initial  population  is  then  subjected  to 
genetic  operators  {mutation,  crossover,  reproduction} .  The  fitness  function  to  determine 
reproduction  is  calculated  based  on  classical  Bayesian  operators.  Figure  3  below 
represents  the  network  with  the  instantiated  variables  (nodes)  coded  by  { 1 } .  The 
generated  string  for  all  the  parameters  to  be  manipulated  is  represented  as: 


1 1„  Ha  h  i  h2  h  i  h„  Xj  Xi  Xj  Xk  S /  S?  S3  Sr  m  /  mj  ny 
1  0  1  1  0  0  0  1  0  1  1  0  1  0  1  0  1 


1,0 


Figure  3:  The  network  with  all  the  instantiated  variables  coded  { 1 }  .the  nodes  are  given 
position  coordinates  for  the  search  process 


In  a  previous  work,  Gelsema  (1995)  applied  a  GA  to  abductive  reasoning  in  Bayesian 
belief  networks.  Gelsema  used  a  two  level  network  depicting  a  classical  diagnostic 
problem.  Our  approach  differs  significantly  from  Gelsema’s  approach  in  two  ways. 
Foremost,  Gelsema’s  goal  was  to  find  the  states  of  the  network  (solutions)  with  the 
highest  overall  posteriori  probability.  To  do  this,  the  fitness  function  was 
straightforwardly  calculated  as  a  product  of  n  multipliers,  one  for  each  of  the  n  nodes  in 
the  network.  This  could  be  seen  as  more  of  a  search  for  an  optimal  solution.  The  BAM 
model  does  not  search  for  the  optimal  solution;  rather  it  searches  for  the  most  probable 
outcome  (hypothesis)  given  the  evidence  in  the  prospective  sensemaking  phase  using 
abductive  inference.  In  retrospective  sensemaking,  the  BAM  model  searches  for  the 
evidence,  given  a  probable  outcome  (hypothesis). 

Sample  Results 

To  clarify  the  approach,  using  a  hypothetical  network,  an  array  of  conditional  probability 
tables  was  generated  using  Bayesian  abduction  inference.  The  results  of  the  sample 
calculations  are  shown  in  Table  1  below. 


Table  1:  Sample  calculations  using  MatLab  software 


Array  1;  P(h\H0) 

K  1  ho 

Ha=  1 

Hx  =  hx 

0.8 

H2  =  Hr, 

0.5 

H 3  =  h3 

0.3 

H 4  h 4 

0.9 

Array  2;  P(Xi\hi) 

xi  1  hi 

H  i  =  K 

H  2  ^2 

H3  =  h3 

II 

Xx  =  *, 

0.7 

0.2 

0.6 

0.1 

X 2  ~  X2 

0.3 

0.4 

0.5 

0.8 

X3  =  x3 

0.9 

0.3 

0.6 

0.1 

H 

II 

0.1 

0.9 

0.7 

0.5 

Array  3:  P(Sl\Xj) 

Si  |  xt 

X;  =x: 

X 2  ~  X2 

X3  =  x3 

A4  =x4 

Sl=sl 

0.5 

0.6 

0.9 

0.3 

S  2  =  S2 

0.1 

0.0 

0.5 

0.4 

S  3  =  S3 

0.9 

0.1 

0.3 

0.5 

S4  =  s4 

0.5 

0.6 

0.7 

0.4 

Array  4:  P(nii\Sj) 


Ill  . 

\st 

s  ,=* 

(N 

II 

<N 

Co 

S  3  =  s3 

II 

■''t 

M, 

=  111 ! 

0.6 

0.3 

0.8 

0.1 

m2 

=  m2 

0.3 

0.5 

0.4 

0.9 

m3 

=  m3 

0.1 

0.9 

0.2 

0.6 

The  variable  names  in  the  arrays  are  replaced  with  the  position  coordinates  representing 
the  variables.  When  a  new  infonnation  arrives  to  the  analyst,  the  corresponding 
information  a  variable  is  either  defined  or  instantiated,  and  coded  by  a  { 1 }  in  the  string. 
The  GA  model  then  performs  the  abductive  inference  by  perfonning  the  computation  for 
all  possible  states  of  the  instantiated  network  variables  and  giving  the  approximate 
inference.  The  result  is  then  output  as  the  most  probable  explanation. 

Figure  4  illustrates  the  sample  results  using  1000  generations  from  a  genetic  algorithm. 
The  graph  shows  how  the  most  probable  outcome  varies  as  we  manipulate  the  value  of 
one  variable  hi  .For  example  if  the  analyst  believes  there  is  a  70%  chance  that  the 
Operational  Focus  of  the  adversary  is  node  hi  then  there  is  a  30%  chance  that  the  targeted 
node  is  m3.  If  on  the  other  hand  the  analyst  has  reason  to  totally  discount  the  possibility  of 
the  Operational  Focus  being  node  hi  (in  other  words, 0%  chance  for  node  A;), then  the 
node  with  the  highest  probability  of  being  targeted  would  be  m2  (  26%  chance).  Notice 
also  that  with  a  30%  chance  of  occurrence  for  node  hi  both  mi  and  m3  are  equally  likely 
targets.  If  the  probability  of  hi  occurring  is  increased  to  0.4  then  both  mi  and  m2  are 
equally  likely  targets.  With  hi  instantiated  with  probability  0.35,  m2  and  m3  are  equally 
likely  to  be  targeted  and  it  would  be  left  to  the  analyst  to  look  at  other  contributing 
factors  before  making  further  inference.  Figure  5  is  a  Venn  diagram  to  capture  the  above 
result  explanations.  Similarly,  backward  inferences  can  be  made,  starting  with  apriori 
probabilities  for  the  targets  and  inferring  most  probable  outcomes  for  any  of  the  other 
network  variables. 

Conclusion 

In  this  paper  we  have  presented  a  computational  model  of  adductive  inference  using 
Bayesian  techniques.  We  use  a  genetic  algorithm  to  solve  a  BAM  directed  information 
fusion  problem  that  deal  with  a  multiple  hypotheses  sensemaking  problem.  By  using  a 
constructive  information  network  from  Iraq  conflict,  we  demonstrate  our  model  in  terms 
of  robustness  when  compared  to  the  traditional  Bayesian  model  alone.  Sample  simulation 
experiments  with  a  small  information  network  were  used  to  demonstrate  the  model 
efficacy.  The  BAM  model  is  still  being  refined  and  future  tasks  include  developing  a  user 
interface  for  the  BAM  that  can  be  used  by  intelligence  analysts  and  comparing  the  current 
results  to  decision  tree  approaches. 


Figure  4:  A  graph  showing  a  sample  GA  run.  Variable  hi  is  instantiated  for  different 
values  and  the  resultant  steady  state  probabilities  of  variable  m3  are  displayed. 


Figure  5:  Solution  space  showing  the  feasible  solutions  for  the  sample  run  in  figure  4 

above 
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Motivation 


•  Asymmetric  battlespace  environments  call  for  strategy  rethink 

-  Complex  and  “wicked”  environment 

-  Disparate  information  sources  coupled  with  Uncertainty, 
ambiguity  and  dynamicity 

•  Deliberate  MDMP  is  not  sufficient 

-  Linearity  assumptions  for  non-linear  asymmetric  situations 

•  Generating  COA  must  be  progressive  and  opportunistic 

-  Recalibration  of  the  usual  prescriptive-normative  models  of  judgment 
and  choice  to  fight  unknown  adversaries 

•  SENSEMAKING:  Precursor  to  MDMP 

-  “Connecting  dots”  to  disparate  information 

-  Seeking  explanations  to  unexpected  evolving  situations 

-  Dynamic  re-planning  and  re-tasking  based  on  prospective  and 
retrospective  analysis 
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Sensemaking  in  the  Context  of  C2 


•  How  battle  staff  reduce  uncertainty  or  ambiguity  during  decision 
making  processes 

•  Aggregation  of  fragmentary  battle  space  information  (deriving 
meaning  from  fragmentary  cues) 

•  Dynamic  re-planning  and  re-tasking  to  account  for  the  evolving 
asymmetric  battlespace  environments 

•  Aiding  the  commanders  situation  awareness  by  capturing  the 
evolving  states  of  battle  dynamics,  the  information  equivocality  and 
the  commander’s  intent. 
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Illustration  by  Example 


A  hypothetical  scenario:  Analyzing  the  Iraq  insurgency 

•  The  Battlestaff  start  out  with  various  hypotheses  regarding  a 
perceived  desired  end  state  of  an  insurgent  operation,  H0 

•  To  achieve  this  end  state,  the  insurgents  have  various  operational 
foci,  /?,. Examples  of  this  could  be  funneling  money  and  weapons  to 
a  particular  cell,  attacking  soft  targets  to  draw  out  coalition  forces 
etcetera. 

•  For  each  operational  focus  there  is  a  motive  X ]  or  motives  that 
avail  themselves  to  the  insurgents 

•  The  operational  focus  and  the  motivation  are  uniquely  effected  by 
a  pre-identified  influence  pathway,  S,,  The  influence  pathway  is  a 
unique  action  or  sets  of  actions  that  will  be  used  to  influence 
operations  to  achieve  the  desired  end  state 
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Illustration  by  Example 


•  In  this  case,  an  influence  pathway  could  be  the  use  of  inflammatory 
religious  sermons,  political  pressure-AI  Sadr  withdrawing  from  the 
unity  government,  arming  militias,  etcetera. 

•  For  each  of  the  unique  influence  pathways  there  is  a  specific  set  of 
targets,  m ,  to  be  attacked  and  targeted  actions  designed  to 
collectively  bring  about  the  desired  end  state  (Mosques,  Bridges, 
Coalition  Ops  Bases,  Kidnappings,  etcetera) 

-  Figure  1  illustrates  this  example 


12th  ICCRTS,  Newport,  Rl,  June  2007 


North  Carolina  Agricultural  &  Technical  State  University 


Illustration  by  Example 


Figure  1 


Operational 

Focus 


Motivation 


Influence 

Pathways 
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Illustration  by  Example 


Construct  a  network  to  represent  all  the  variables  in  the 
scenario. 

Issues  for  analytical  sensemaking: 

•  For  a  simple  hypothetical  scenario  note  the  multiplicity 
of  causal  linkages!! 

•  Complexity  increases  with  increasing  variables;  in  real 
life  battle  space  environments  we  expect  a  large 
number  of  variables  and  multiple  linkages;  We  may  not 
even  be  able  to  identify  all  of  them;  Some  are 
interrelated,  some  are  latent 
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Illustration  by  Example 


Of  interest  for  C2  sensemaking: 

•  What  happens  when  new  information  arrives  to 
the  intelligent  analyst? 

•  How  does  the  network  behave? 

•  What  variables  are  affected? 

•  Are  the  effects  serious  enough  to  warrant 
immediate  changes  in  the  existing  COA? 
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Illustration  by  Example 


•  Examples:  The  adversaries  change  their  attack  methods  (armor 
penetrating  IEDS); 

•  What  is  the  most  likely  target ? 

•  What  is  the  influencing  factor?  (Sourced  from  Iran?); 

•  What  is  the  likely  change  in  operational  focus?(  From  soft  targets 
to  armored  coalition  patrols). 

•  Does  it  represent  an  operational  shift  from  low  level  attritional 
attacks  to  bold  guerilla  style  hit  and  run  tactics? 

•  If  so,  what  end  state  does  the  adversary  hope  to  achieve  by 
focusing  on  these  particular  variables? 
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Bayesian  Abduction  Model 


•  The  existing  COA  and  planning  models  not  flexible  to  handle  the 
types  of  scenario  described  above 

•  We  have  proposed  the  Bayesian  abduction  model  that  combines 
sensemaking  with  Peircean  abduction  reasoning  to  model 
complex  situations  where  information  ambiguity,  equivocality  and 
dynamicity  are  dominant. 

•  Using  this  model,  an  intelligence  analyst  is  able  to  fuse  information 
from  disparate  sources  in  real  time  to  identify  variables  and  causal 
links  of  interest  from  the  multiplicity  of  factors  in  the  complex 
battlespace  environment. 

•  The  analyst  can  then  use  abductive  reasoning  to  form  plausible 
explanations  for  the  situation  of  interest 


12th  ICCRTS,  Newport,  Rl,  June  2007 


North  Carolina  Agricultural  &  Technical  State  University 


Bayesian  Abduction  Model 


Features: 

•  Generates  a  list  of  exhaustive  and  mutually  exclusive  hypotheses 
regarding  a  scenario  of  interest. 

•  Represents  all  the  variables  of  interest  in  the  scenario  as  nodes  to 
generate  a  belief  network.  Links  from  a  parent  node  to  a  child 
node  are  causal  links. 

•  Uses  Bayesian  analysis  to  evaluate  all  the  possible  states 
(solutions)  for  the  network. 

•  Applies  Peircean  abduction  reasoning  to  infer  to  the  best 
explanation.  (E  is  your  collection  of  evidence;  Hypothesis  /7, 
explains  E  ;No  other  explanation  explains  E as  well  as  /?,;  therefore 
hj  is  probably  correct) 
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Bayesian  Abduction  Model 


•  Uses  Genetic  algorithm  (GA)  to  perform  fast  and  efficient  search 
for  plausible  alternatives  presented  as  possible  states  of  the 
network 

•  The  analyst  makes  a  judgment  call  based  on:  How  strong  h ,  as 
compared  to  other  alternatives;  independent  of  all  h,  how  good  is  h , 
?How  confident  are  you  in  the  accuracy  of  E  ?;  How  thorough  is 
the  search  for  other  plausible  alternatives  ?. 
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Bayesian  Abduction  Model 


Generate  initial  population: 
(initial  sampling) 

Instantiation  done  according  to  pre¬ 
specified  rules. 


Selection: 

According  to  the  probability  metric, 
set  a  threshold  probability  value 
for  selection 


Reproduction: 

Possible  solutions  are  combined 
(Different  paths  are  taken  to 
evaluate  all  possible  states  of  the 
network) 


Abductive  Inference  for 
sensemaking 
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Bayesian  Probabilistic  Reasoning 


Rationale: 

•  Intelligence  analysts  assign  subjective  conditional  probabilities  to 
variables  of  interest  in  order  to  analyze  their  impact  in  a  given 
scenario. 

•  The  conditional  probabilities  are  based  on  the  “belief  state”  of  the 
analyst,  not  classical  probability. 

•  The  analyst  starts  of  by  assigning  a  conditional  probability  to 
hypothesis  h  apriori  based  on  his/her  expertise  and  knowledge. 
Upon  obtaining  some  new  evidence  D,  the  apriori  epistemic  state  P 
(state  of  knowledge)  is  revised  by  Bayes  theorem  into  a  conditional 
probability  given  by 
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Bayesian  Probabilistic  Reasoning 


•  P(h)  denotes  the  initial  probability  that  hypothesis  h  holds  , before 
we  incorporate  any  new  data. 

•  P(D)  denotes  the  probability  that  evidence  data  D  will  be 
observed.  P(D)  represents  the  probability  of  evidence  D  given  no 
knowledge  about  which  hypothesis  holds. 

•  P(D\h)  denote  the  probability  of  observing  data  D  given  some  world 
in  which  hypothesis  h  holds 

•  We  are  interested  in  the  probability  P(h\D)  that  h  holds  given  the 
observed  data  D 
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Peircean  Abduction  Reasoning 


A  process  of  reasoning  that  tries  to  form  a  plausible  explanation  for 
new  and  anomalous  data. 

•  Classification  of  a  given  data  set  into  potentially  relevant 
elementary  explanatory  hypotheses. 

•  Given  an  observation  d  and  the  knowledge  that  h  causes  d,  it  is  an 
abduction  to  hypothesize  that  h  occurred. 

•  Given  a  proposition  q  and  the  knowledge  that  p^q,  it  is  an 
abduction  to  conclude  p. 

•  Is  inherently  uncertain  since  information  or  data  supporting 
abduction  process  is  dynamic  in  nature,  leading  to  human 
construction  of  multiple  and  often  competing  hypotheses. 
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Modeling  Approach 


We  have  a  certain  problem  space  or  world  P(w)  comprising  of  certain 
events  of  interest  P(E). 


-  Let  P(w)  =  ZP(E)  where  E  is  an  explanation  of  world  W 
Assuming  independent  events 


^=rL^) 


P(w  |  E)  = 


P(w&E ) 


P(E) 


The  Abduction  process  in  sensemaking  is:  Given  E ,  explain  E,  then 
try  to  infer  i/i/from  these  explanations 


Extend  the  model  to  account  for  uncertain  information.  An  uncertain 
consequence  corresponds  to  an  event  E,  along  with  the  probability  a 
that  E  did  not  happen, 


P(w  |  E,a)  =  aP(w  \  E)  +  (1  -  a)P(w  \  E ) 
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Modeling  Approach 


•  In  the  case  of  a  set  of  alternatives  E„  /=  1,2...n ,  one  of  which  is 
true,  we  extend  the  above  equation  thus 

P(w  I  -  Yjaip(w  I £/) 


•  Formulate  the  problem  as  a  belief  network  showing  all  the  causal 
linkages  together  with  the  associated  conditional  probabilities. 

•  Once  the  state  of  the  network  is  determined  with  all  the  instantiated 
variables  determined,  it  is  straightforward  to  perform  backward  or 
forward  inference. 

•  Use  a  fast  search  algorithm  such  as  the  genetic  algorithm  (GA)  to 
perform  the  search  and  computation  for  the  most  probable 
hypothesis-Abductive  inference  in  belief  networks  is  NP-hard;  The 
more  complex  the  network,  the  harder  the  computation. 


12th  ICCRTS,  Newport,  Rl,  June  2007 


North  Carolina  Agricultural  &  Technical  State  University 


Modeling  Approach 


•  A  Genetic  algorithm  is  an  adaptation  procedure  based  on  the 
mechanics  of  natural  genetics  and  natural  selection.  GA’s  search 
from  a  population,  not  a  single  point  and  use  randomized  operators 
as  opposed  to  deterministic  rules. 

•  GA’s  can  handle  very  complex  network  problems. 

-  Perform  fast  and  efficient  computation  over  large  search  spaces 

-  Inference  is  performed  as  a  search  in  a  large  discrete  multi¬ 
dimensional  space 

-  Adaptive  search  facilitates  the  discovery  of  network  states  with  high 
probability  instantiations 

-  Represent  multiple  states  for  each  variable  depending  on  the 
cardinality  we  select  for  the  genetic  coding. 
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Simulation 


Consider  the  hypothetical  scenario  previously  described 

•  Code  all  the  variables  as  a  finite  length  string  (in  this  case, 
cardinality  2  so  that  the  set  {0,1}  is  sufficient  to  represent  all  the 
states  of  the  variables) 

•  At  any  instance,  the  state  of  the  network  is  fully  determined  by  a 
vector  a,  where 


a= 


1  if  a  node  Ckj  is  instantiated 
.0  otherwise 


•  The  resulting  network  representation  for  all  nodes  is  a  binary  pair 
{Cj,  a}  for  all  nodes  k. 

•  The  initial  population  is  generated  by  coding  each  of  the  variables 
with  a  {0,1}  depending  on  the  state  of  the  instantiation 

See  Figure  2 
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Simulation 
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Simulation 


•  Subject  the  initial  population  to  genetic  operators  {mutation, 
crossover,  reproduction} 

•  The  fitness  function  to  determine  propagation  is  calculated  based  on 
the  defined  Bayesian  operators 

•  Start  by  assigning  some  apriori  conditional  probabilities  such  as 

P(H„)  =  0A 

Implying  we  are  only  40%  confident  that  our  chosen  hypothesis 
regarding  the  end  state  is  plausible. 

•  Similarly  prior  probabilities  of  all  instantiated  variables  can  be 
determined  by  straightforward  application  of  Bayes  theorem,  for 
example 
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Sample  Results 
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Sample  Results 


Sample  GA  run.  Variable  h1  is  instantiated  for  different  values  and  the 
resultant  steady  state  probabilities  of  variable  m,  are  displayed. 
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•  The  graph  shows  how  the  most  probable  outcome  varies  as  we 
manipulate  the  value  of  one  variable  h1  .For  example  if  the  analyst 
believes  there  is  a  70%  chance  that  the  Operational  Focus  of  the 
adversary  is  node  h1  then  there  is  a  30%  chance  that  the  targeted 
node  is  m3. 

•  If  0%  chance  for  node  /7^then  the  node  with  the  highest  probability 
of  being  targeted  would  be  m2  (  26%  chance). 

•  Notice  also  that  with  a  30%  chance  of  occurrence  for  node  h1 
both  m1  and  m3  are  equally  likely  targets 

•  If  the  probability  of  h1  occurring  is  increased  to  0.4  then  both  m1 
and  m2  are  equally  likely  targets.  In  this  case,  it  is  left  to  the  analyst 
to  look  at  other  contributing  factors  before  making  inference 

See  Venn  Diagram  in  Figure  4 
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Sample  results 


Figure  4 


Solution  space  showing  the  feasible  solutions  for  the  sample  problem 


12th  ICCRTS,  Newport,  Rl,  June  2007 


North  Carolina  Agricultural  &  Technical  State  University 


•  This  paper  proposes  an  analytical  sensemaking  model  to  aid  the 
C2  decision  making  process  that  combines  Bayesian  formalism 
with  Peircean  abduction  reasoning  . 

•  The  Bayesian  abduction  model  (BAM)  has  been  implemented 
using  GA  .The  developed  model  and  algorithms  will  improve  the 
design  of  sensemaking  support  systems  for  the  Future  Combat 
Force 

•  The  aim  of  the  modeling  process  is  twofold:  Foremost, 
retrospectively  discovering  or  identifying  variables  or  combinations 
therefore  that  can  adequately  explain  observed  adversary  COA 
and  secondly;  Identifying  variables  and  causal  linkages  that  can 
aid  in  predicting  an  adversary’s  set  of  COA. 

•  The  model  provides  an  advantage  to  information  fusion  in  a  system 
characterized  by  dynamicity  and  complexity — evolving  system 
states. 
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